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ABSTRACT 

We measure the evolution of the X-ray luminosity-temperature (Lx — T) relation since z ~ 1 .5 
using a sample of 211 serendipitously detected galaxy clusters with spectroscopic redshifts 
drawn from the XMM Cluster Survey first data release (XCS-DR1). This is the first study 
spanning this redshift range using a single, large, homogeneous cluster sample. Using an 
orthogonal regression technique, we find no evidence for evolution in the slope or intrinsic 
scatter of the relation since z ~ 1.5, finding both to be consistent with previous measurements 
at z ~ 0.1. However, the normalisation is seen to evolve negatively with respect to the self- 
similar expectation: we find £ _1 (z)L x = 10 44 - 67±009 (r/5) ?04±o f6 (l +z)~ f - 5±0 - 5 , which is 
within 2a of the zero evolution case. We see milder, but still negative, evolution with respect to 
self-similar when using a bisector regression technique. We compare our results to numerical 
simulations, where we fit simulated cluster samples using the same methods used on the XCS 
data. Our data favour models in which the majority of the excess entropy required to explain 
the slope of the Lx — T relation is injected at high redshift. Simulations in which AGN feed- 
back is implemented using prescriptions from current semi-analytic galaxy formation models 
predict positive evolution of the normalisation, and differ from our data at more than 5a. This 
suggests that more efficient feedback at high redshift may be needed in these models. 

Key words: galaxies: clusters: general - galaxies: clusters: intracluster medium - X-rays: 
galaxies: clusters - galaxies: high-redshift - cosmology: observations 
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1 INTRODUCTION 

The evolution of the X-ray properties of galaxy clusters records 
both the assembly history of the most massive gravitationally 
bound structures in the universe and the thermal history of the in- 
tracluster medium (ICM). Both X-ray luminosity (Lx) and temper- 
ature (T) correlate with cluster mass, allowing the evolution of the 
cluster mass function to be measured, and constraints on cosmo- 
logical parameters, including the dark energy equation of state, to 
be obtained (e.g. Vikh linin et al.|2009[|Mantz et al.|2010) . To make 
further progress in the use of clusters as cosmological probes, it is 
necessary to develop our understanding of the physical processes 
which determine their observable properties. 

The physics that determines the properties of the ICM is more 
complicated than simply the action of gravitational collapse alone, 
which would result in clusters being approximately self-similar, 
and their observable properties obeying simple scaling relations 
with well understood redshift evolution. In the case of the Lx — T 
relation, self-similar evolution predicts Lx « T 2 ( |Kaiser|[l986| . 
However, it is well establis hed that the relation has a steeper slope, 
i.e. L x <* T 2 - 3 (e.g.|Edge & Stewart|l99l]|Markevitch|l998]|A7] 



naud & Evrard 1999, Vikhlinin et al. 2002; Maughan et al. 2006 



Pacaud et al. 2007; Pratt et al.|2009 Takey et al. 2011). This in- 



dicates an additional source of energy is heating the ICM, that is 
more effective in low mass systems. While some energy is injected 
by supernovae (SNe) within galaxies, it is likely that the bulk of 
the energy comes from Active Galactic Nuclei (AGN) in the cen- 
tres of clusters, as observations of low redshift clusters show that 
AGN jets, seen in radio imaging, carve out cavities in the hot gas 
observed at X-ray wavelengths (e.g. |Birzan et al.[ 2004 McNamara 
|et al.|2005||Blanton et al.|20TT) . 

Numerical simulations which include additional energy injec- 
tion into the ICM, such as from AGN feedback, are able to repro- 
duce the observed Lx — T relation at low redshift. However, differ- 
ent energy injection models, which give consistent results at low 
redshift, give different predictions for the evolution of the normal- 
isation of the Lx — T relation with redshift (e.g. Muanwong et al. 
[20061 |Short et al.|2010||McCarthy et al.|2011) . By measuring the 
evolution of the Lx — T relation to high redshift, constraints on 
these models can be obtained. This also feeds naturally into models 
of galaxy formation, which invoke AGN feedback to prevent over- 
cooling in massive haloes: a consistent model of AGN feedback 
should be able to reproduce the observed Lx — T relation as well as 
the galaxy luminosity function (e.g. |Bower et al.|2008} . However, 
to date there is no consensus on the evolution of the relation to 
high redshift: some studies find that the evolution is consistent with 
self-similar (e.g. [V ikhlinin et al.|2002||Lumb et al.|2004||Maughan| 
|et al.|2 006 ; Paca uTTeT^pOO?! , while other studies find evidence 
for either zero or negative evolution (e.g. Ettori et al. 2004 , Reichert 
|et al.|20fT||Clerc et al.|2012) . The X-ray cluster samples on which 
these works are based contain few clusters at high redshift, or are 
heterogeneous (i.e. containing objects drawn from many different 
surveys), making it difficult to account for selection effects, which 
can mimic evolution (e.g. Pacaud et al. 2007 , Shor t et al.|2010[ >. 

In this paper, we examine the evolution of the Lx — T relation 
over the last ~ 9 Gyr using the XMM Cluster Survey (XCsFllRomerl 



|et al.|2 001 ). XCS is a serendipitous search for galaxy clusters in 
the XMM-Newton Science Archive. The X-ray analysis method- 
ology for the survey is described in Lloyd-Dav ies et al.| ( |201 1| >. 
The first data release (XCS-DR1; Mehrte nsTt al.||2012) contains 
a total of 401 X-ray selected clusters with temperature and red- 
shift estimates, the largest such sample to date. The sensitivity of 



XMM-Newton allows XCS to detect a larger number of clusters 
at high redshift compared to earlier serendipitous cluster searches 
conducted with ROSAT; the XCS-DR1 catalogue contains 38 clus- 
ters at z > 0.5 with spectroscopic redshifts and temperature mea- 
surements. The most distant cluster in the sample is J2215. 9-1738 
at z = 1.46 ^Stanford et al.|2006|[ffilton et al.|2007l [20091 [20Tol >. 
In this work we use this wide redshift range to measure the evo- 
lution of the Lx — T relation, and therefore constrain models for 
energy injection into the ICM, such as AGN feedback (e.g. |Short| 
|et al.|2010} . |Stott et al.|p012) present a complementary study of 
the effect of AGN feedback in groups and clusters using a low red- 
shift (z < 0.3) subsample of XCS-DR1 clusters cross matched with 
the FIRST catalogue of radio sources jWhite et al.| [T997). Other 
analyses based on XCS-DR1 include a study of fossil groups and 
clusters (Harrison et al. 2012|, and |Viana et al.| < f2012^ describes 
the predicted overlap with the Planck Sunyaev-Zel'dovich effect 
selected cluster catalogue (Planck Collaboration 201 1 1. 

The structure of this paper is as follows. In Section[2] we pro- 
vide a brief introduction to the XCS-DR1 cluster sample used in 
this work. We describe the method used to measure the Lx — T 
relation and its evolution in Section [3] and present our results in 
Section]?] We discuss our findings in the context of numerical sim- 
ulations in Section[5] and present our conclusions in Section[6] 

We assume a cosmology of Q. m = 0.27, Q.^ = 0.73, and Hq = 
70 km s _1 Mpc~' throughout. 



2 DATA 



The XCS-DR1 cluster catalogue is presented in Mehrtens et al. 
(2012), while the algorithms used in generating the catalogue are 
described in detail in Llo yd-Davies et aL] ( [2011| LD11 hereafter), 
and so here we provide only a brief summary of the data used in 
this paper. 

XCS-DR1 is constructed from 5776 XMM observations, pub- 
licly available before July 2010. A total of 3675 extended X-ray 
sources (i.e. cluster candidates) were detected at > 4a significance 
with > 50 counts using a wavelet based detection algorithm, in an 
area covering ~ 410 deg 2 (see LD1 1). The majority of these cluster 
candidates have yet to be optically confirmed; the XCS-DR1 cata- 
logue consists of the first batch of 401 clusters with redshift and 
temperature measurements (see Mehrtens et al. 2012| l. 

X-ray luminosities and temperatures were measured for each 
cluster in XCS-DR1 using fully automated pipelines. The temper- 
ature measurements are described in Section 4.2 of LD11. Four 
different models, including one simulating the effect of undetected 
AGN contamination, and another simulating the effect of a cool 
core, were fitted to the spectral data using XSPEC ( |Arnaud|1996| >, 
with the best fitting model being adopted for the temperature mea- 
surement. X-ray luminosities were measured within R500 (i.e. the 
radius at which the enclosed mean density is 500 times the critical 
density at the cluster redshift), as described in Section 4.3 of LD1 1, 
by fitting the surface brightness profile using a (3 model l |Cavaliere] 
|& Fusco-Pe miano 19761 and extrapolating to R^qq where neces- 
sary. It is important to note that unlike dedicated follow-up observa- 
tions of known clusters (e.g. |Vikhlinin et al.|2006'}|Pratt et al.|2009[ 
Maug han et al.|2012[ >, the serendipitous data analysed by XCS is 
not of sufficient quality (i.e. low counts, low resolution due to de- 
tection off-axis) to excise emission from cluster cores. 



http: //www. xcs-home . org 




4 6 

T(keV) 

Figure 1. Redshift and temperature distributions for the sample of 211 
XCS-DR1 clusters with spectroscopic redshifts used in this work (see 
Mehrtens et al. 2012. for a description of the catalogue). 



In this work, we use only the subsample of 211 XCS-DR1 
clusters with spectroscopic redshifts. While all of the data used in 
this paper is publicly available in the form of the XCS-DR1 cata- 
logue 2 ) for completeness, the sample used here is listed in a supple- 
mentary table available with the online edition of this article. Fig.[T] 
shows the redshift and temperature distributions of the sample. The 
clusters span the redshift range 0.06-1.46 (median z = 0.28), and 
temperature range 0.6-9.8 keV (median T = 2.9 keV). Note that in 
the analysis presented in this paper, we do not attempt to correct 
for selection effects - given the redshift incompleteness of XCS 
(i.e., many candidate clusters within the survey area from which 
XCS-DR1 is drawn do not have optical follow-up or redshifts), ac- 
counting for selection biases is not straightforward, and is deferred 
to future work. We do however comment on the expected effect of 
Malmquist bias on our results for the Lx — T relation evolution in 
Sectionl5.ll 



3 ANALYSIS 

The large size of the XCS-DR1 catalogue allows us to simultane- 
ously fit for the redshift evolution of the Lx — T relation, in addition 
to its slope, normalisation and intrinsic scatter, using a model of the 
form 



log(£- I (z)L x )=A + Slog(r/5)+Clog(l+z) 



(1) 



Evolution of the Lx — T relation from XCS-DR1 

Table 1. Priors on Lx — T relation fit parameters (see Section]!). 



Parameter 


Uniform Prior 


Notes 


A 


(41,47) 




B 


(1,5) 




C 


(-3, 3) 




s 


(0.01,0.5) 


Orthogonal method only 




(0.01,0.5) 


Bisector method only 


°"i og r 


(0.01,0.5) 


Bisector method only 



where Lx is the bolometric X-ray luminosity measured within i?5oo 
in erg s and T is the X-ray temperature in keV. The advantage of 
this approach is that it avoids the need to bin the data by redshift. 
We set the pivot temperature to 5 keV for ease of comparison with 
other works (e.g. Pratt et al. 2009 1, although this is higher than the 
median temperature (T = 2.9 keV) of the sample. This model as- 
sumes that the slope of the relation does not evolve with redshift. 
Note that we have scaled the luminosities by E^ 1 (z) (the evolution 
of the Hubble parameter, i.e. E(z) = [flm(l +z) 3 +£2 A ] I//2 ), which 
is the evolution expected in the self-similar case, in which clusters 
are expected to become more luminous at fixed temperature as red- 
shift increases. Hence C = corresponds to self-similar evolution, 
while C < indicates evolution which is slower than self-similar. 

We estimate the parameters of this model using Markov Chain 
Monte Carlo (MCMC), using two different methods which both 
take into account the intrinsic scatter and the measurement errors. 
Our approach is similar to that of Weine r~et al.| (2006; see also 
Kelly 2007). Firstly, we define an orthogonal regression method, 
for which the probability density for a given cluster to be drawn 
from this model is 



^model : 



1 



^/2n(Ar 2 + S 2 ) 



exp 



('•-'•model) 



2(Ar 2 + 5 2 ) 



(2) 



where r — r mo( j e i is the orthogonal distance of the cluster from the 
model relation in the log Lx-log T plane; Ar is the error on the 
orthogonal distance, obtained from the projection in the direction 
orthogonal to the model line of the ellipse defined by the errors 
on log Lx, log T (appropriate sides of asymmetric error bars are 
chosen here according to the position of a given point relative to the 
model fit line); and 5 is the (orthogonal) intrinsic scatter. The latter 
can be converted into the scatter in the logLx axis (Oi gL x ) using 

Oi ogI , x =5/cos(tan- 1 B). (3) 

We also use a bisector method, in which the scatter and mea- 
surement errors in each axis are treated independently. In this case, 
^model is the product of the Gaussian probabilities of the residuals 
of Lx and T from the given bisector best-fitting line defined by the 
model parameters, i.e., we substitute 

-\og(E- l {z)L x )- 



^model : 



http: / /www. xcs- home . org/datareleases 



[A + Slog(r/5)+Clog(l+z)], (4) 

^modei=log(r/5)-[log(£- 1 (z)Lx)-A-Clog(l+z)]/S, (5) 

instead of r mo d e i in Equation [2] and replace r, Ar as appropriate. 
We replace S with two parameters, <Ti og £ x and Oi og j. 

For both methods, the likelihood L of a given model is sim- 
ply the product of -P mo del f° r each cluster in the sample, i.e., in the 
orthogonal case 

L(L X ,T\A,B,C,S) ~ P wim (A,B,C,S)l\P modeU , (6) 

i 

where we assume generous, uniform priors on each parameter, 
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REXCESS (Pratt et al. 2009), 




0.0 < z < 0.25 



XCS-DR1 (N = 77) 




XCS-DR1 (N=38) 




0.5 < z < 1.5 



T(keV) 



T(keV) 
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Figure 2. Lx — T relations for subsamples of XCS-DR1 in redshift bins. The dashed line shows the best fit to the data from the orthogonal method, while 
the dotted line shows the best fit according to the bisector method (see Section[3j. The red triangles in the left hand panel show the REXCESS sample jPratt| 
|et al.|2009] l for comparison. Neither the slope nor scatter change significantly with redshift using the orthogonal method, while we see shallower slopes at high 
redshift using the bisector method (see Table[2j. 



Table 2. Lx — T relation fit parameters, derived from the bisector and orthogonal methods (see Section|5J, for XCS-DR1 subsamples in redshift bins. The 
model fitted is log(£~ ' (z)Lx) = A + Blog(r/5), and the units of T and Lx are keV and erg s respectively. The uncertainties are the marginalised 68 per 
cent confidence regions on each parameter derived using MCMC. 









Bisector 






Orthogonal 




Redshift range 


N 


A 


B 


a logL x 


A 


B 


°logL x 


0.0 < z < 0.25 


96 


44.43 ±0.06 


2.81±0.14 


0.40 ±0.04 


44.63 ±0.10 


3.18±0.22 


0.33 ±0.04 


0.25 < z < 0.5 


77 


44.36 ±0.04 


2.45±0.14 


0.33 ±0.04 


44.47 ±0.07 


2.82 ±0.25 


0.23 ±0.04 


0.5 <z< 1.5 


38 


44.23 ±0.04 


2.17±0.19 


0.24 ±0.03 


44.28 ±0.07 


2.89 ±0.45 


0.24 ±0.05 



which are listed in Table Q] We obtain estimates of the model 
parameters from the posterior distributions using MCMC, imple- 
mented using the Metropolis et al. ( 1953 1 algorithm. 

As shown in the next section, for C = 0, the results given by 
the bisector and orthogonal methods are bracketed by those ob- 
tained when using the Kelly ( 2007 1 method, with T alternately used 
as the dependent or independent variable. It is important to note that 
there is no single method which gives the 'true' underlying slope 
and normalisation for problems with errors in both variables and in- 
trinsic scatter: each method gives a slope and normalisation which 
depends upon the assumptions in the method. Throughout this pa- 
per, we show the results from both methods, to give an idea of the 
possible systematic error arising from the choice of fitting method. 



4 RESULTS 

4.1 Evolution of the slope and intrinsic scatter 

The model for the evolution of the Lx — T relation defined in Equa- 
tion [T] assumes that there is no evolution in the slope of the rela- 
tion. We checked for this by fitting the Lx — T relation of subsam- 
ples divided into redshift bins (0.0 < z < 0.25, 0.25 < z < 0.5, and 
0.5 < z < 1.5), setting C = in Equation[T] Fig.|2]and Table|2]show 
the results. 

For the lowest redshift bin (z < 0.25), we find a similar slope 
using both the orthogonal and bisector methods to that found in 
many previous studies. The values we derive ar e consistent with t he 
value of 3.35 ± 0.32 measured by REXCESS^ ( Pratt et al.|2009) at 
z = 0.1, as well as numerous other works (e.g. Markevitc h|1998| 



|Arnaud & Evrard|1999|[Wu et al.|1999) , and most of the z < 0.3 
subsamples of XCS-DR1 clusters in the study by |Stottet aL] ( |2012| ), 
in which a different fitting technique was used. 

We find different values for the normalisation, depending on 
the fitting technique employed. For the orthogonal method, we find 
A = 44.63 ±0.10, which is slightly lower, but within 2a, of the 
REXCESS value (44.85 ± 0.06; |Pratt et al.|2009) . The normalisa- 
tion obtained using the bisector method (A = 44.43 ± 0.06) is about 
5a lower than the REXCESS value. This seems to be driven by the 
degeneracy between the slope and normalisation, with the orthogo- 
nal method preferring steeper slopes. As can be seen in the left hand 
panel Fig. [2] the bisector method gives more weight to a population 
of low Lx, but relatively high T objects, resulting in a shallower 
slope and correspondingly lower normalisation. 

Clearly, as shown in the left panel of Fig. [2] there is not much 
overlap between the XCS and REXCESS temperature ranges, so it 
is not surprising that there is some difference between the normali- 
sation s of the two samples. This may also be in part due to the use 
of the Cash \l919\ rather than % 2 statist ic in the XCS spectral fitting 
(LD 1 1 ; see also Hump hrey et al.|20 09 1, or reflect differences in the 
sample selection, if for example the XCS sample contains a smaller 
fraction of cool core clusters, which prefer a higher normalisation 
(see, e.g., Table 2 of |Pratt et alT] l [200"9"l >, where A = 45.11 ±0.16 
for cool core clusters, while A = 44.70 ± 0.03 for non-cool core 
clusters). 

We find that both the bisector and orthogonal fit results are 



3 We compare to REXCESS measurements with the core emission in- 
cluded (i.e., the L\, Ti values in Table 2 of |Pratt et al.|2009| . 
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Figure 3. The Lx — T relation for the 21 1 XCS-DR1 clusters with spectroscopic redshifts. The dashed line is the best-fitting four parameter model (Equation[TJ, 
determined using the orthogonal fitting method. The luminosities have been scaled to take into account the evolution in the normalisation as a function of 
redshift inferred from the best-fitting model parameters, as well as the (z) evolution expected in the self-similar case. 




Figure 4. One and two dimensional marginalised distributions (contours mark 68 and 95 per cent confidence limits) for each combination of parameters in the 
four parameter evolving Lx — T relation model (Equation[TJ, as determined using the orthogonal fitting method. Note that the luminosities have been scaled 
by E (z), and so C = corresponds to the case of self-similar evolution. 
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bracketed by those obtained using the [Keilyl p007) method, de- 
pending on the choice of the dependent variable. With T as the in- 
dependent variable, we find A = 44.42 ± 0.09 and B = 2.67 ± 0. 19, 
which is in good agreement with our bisector method, although 
with shallower slope. For T as the dependent variable, we infer 
A = 44.70 ± 0.10 and B = 3.38 ± 0.23 using the [KeUy] p007) 
method, which are in good agreement with our orthogonal method, 
although with slightly higher slope and normalisation. 

The redshift evolution of the slope is different for the two fit- 
ting methods. For the orthogonal method, the slope does not change 
significantly with redshift, with the values found for each subsam- 
ple differing by about lrj. This is consistent with the findings of 
previous studies, which have measured the slope at z > 0.4 using 
smaller samples than that used in this work (e.g. Vikhlinin et al.| 



2002l|Novicki et al.|2002[|Ettori et al.|2Tj04]|Maughan et al.|2006l 



20T2|fTakeyetal.|2011> . However, we do see flattening of the slope 
with increasing redshift in the fits using the bisector method. While 
this may be real, it is also an expected signature of Malmquist bias, 
which is discussed in Section [57X1 

The intrinsic scatter in the relation appears to decrease slightly 
with redshift (see Table|2j, although the difference in the scatter be- 
tween any two redshift bins (using either fitting method) is gener- 
ally less than 2a. This suggests there might be a decreasing fraction 
of cool core clusters at high redshift, although of course better data 
are needed to determine if this is the case. We note that a decrease 
in the scatter at high redshift could alternatively be due to selection 
effects, as shown by Reichert et al. (2011) using simulated cluster 
samples. Our intrinsic scatter estimates for the lowest redshift bin 
are consistent with the REXCESS measurement at z = 0. 1 {Pratt] 
|et al.|2009| >. 

We conclude, on the basis of these results, that it is reason- 
able to use a model with fixed slope and scatter to measure the 
evolution of the normalisation of the relation with redshift (which 
appears to evolve negatively in Fig. [2}, when using the orthogonal 
fitting method. We also present the results obtained for the bisec- 
tor method throughout, as this gives an indication of the possible 
systematic error due to the choice of fitting technique. 



4.2 Evolution of the normalisation 

We measure the evolution of the normalisation of the Lx — T rela- 
tion by fitting the four parameter model described by EquationJTJto 
the complete sample of 21 1 clusters, with C now allowed to vary. 
This is the first such measurement over this redshift range using 
clusters drawn from a single survey and analysed in a consistent 
way. Our large dataset allows us to fit for all parameters simulta- 
neously, without fixing the normalisation to that measured in a dif- 
ferent low redshift sample (e.g. Markevitch 1998 , Arnaud & Evrard 



et al. 



1999), as has often been done in past studies of this type (e.g. Ettori 



2004, Maughan et al. 2006] |Pacaud et al.|2007) . 



Since the bisector method shows a preference for shallower 
slopes at higher redshift, we focus first on the results obtained us- 
ing the orthogonal fitting method. Fig.[3]presents the Lx — T rela- 
tion for the whole sample, with E~ l (z)Lx scaled according to the 
redshift evolution inferred from the best-fitting model 

log(£ -1 (z)L x ) = (44.67±0.09) + (3.04±0.16) log(T/5) 

- (1.5±0.5) log(l+z), (7) 

with 5 = 0.085 ±0.008 (i.e., Oi ogLx = 0.27 ±0.03). Fig.|4]shows the 
one and two dimensional marginalised probability distributions for 
each parameter. We see, as expected given the model definition, that 



ixFit(z-o) = (T/5f M I 10 44 " 7 
self-similar evolution [E(z)] 
no evolution 

-1.5 ±0.5 




Figure 5. Evolution of the normalisation of the Lx — T relation relative to 
the self-similar case [E(z)], as inferred from the best-fitting four parameter 
model (Equation [7J, using the orthogonal fitting method. The shaded area 
shows the marginalised 68 per cent confidence region on the evolution de- 
rived using MCMC. The dot-dashed line shows the track for no redshift 
evolution in the normalisation of the relation. The black diamonds show 
individual XCS clusters (error bars are omitted for clarity). 



the slope, normalisation, and redshift evolution are all degenerate 
to some extent. 

As for the fits to the subsamples in redshift bins (Section |4, l| l, 
the slope and scatter are consistent with low redshift samples. The 
z — normalisation inferred from the model (A = 44.67 ± 0.09) is 
slightly lower than that found in REXCESS (44.85 ± 0.06; [Pratt] 
|et al.|20 09 ), but is consistent within less than 2a. 

We find that the redshift evolution of the normalisation is neg- 
ative (C = — 1 .5 ± 0.5), indicating that the evolution in luminosity 
at fixed temperature is significantly less than the self-similar pre- 
diction (C = 0). However, the evolution we see is within 2rj of the 
no evolution case. This is shown graphically in Fig. [3] We checked 
the sensitivity of this result to reducing the redshift range, using 
a subsample of 183 clusters restricted to z < 0.6. We find consis- 
tent results for all parameters, although the deviation of the nor- 
malisation from the self-similar prediction is not significant in this 
case (C = —0.7 ±0.9), and is also consistent with null evolution to 
within less than lo. 

In Fig. [5] we see that the highest redshift cluster in our sam- 
ple, J2215. 9-1738 at z = 1.46, has properties consistent with self- 
similar evolution. This is in contrast to our previous analysis of 
this cluster (Hilton et al. 2010), where we found it to be under- 
luminous given its temperature. This is due to the assumption of 
the Markevitch ( 1998) Lx — T relation parameters in Hilto n et al.| 
(2010) when estimating the deviation of J2215. 9-1738 from self- 
similarity. If we adopt the Lx and T measurements from Hilton 
|et al.| ( |2010| > for this system, and apply the best-fit Lx — T rela- 
tion parameters derived in this work using the orthogonal MCMC 
method (i.e. A = 44.67 ± 0.09, B = 3.04 ±0.16), then we find 
J2215. 9-1738 is well within la of the self-similar prediction. 

Repeating the analysis on the whole sample using the bisec- 
tor method, we find that the redshift evolution (C = —0.5 ± 0.3) is 
closer to self-similar than we found using the orthogonal method. 
The milder evolution seen in this case seems to be driven by the 
much lower z = normalisation found using the bisector method 
(A = 44.41 ±0.05); this is significantly (approximately 5a) lower 
than the REXCESS normalisation (see Section [4~T| above). Since 
the values of the slope and normalisation are degenerate, and we 
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Table 3. Lx — T relation fit parameters, derived from the bisector and orthogonal methods (see Section[3j, for the full XCS-DR1 cluster sample with spec- 
troscopic redshifts. In all cases, the normalisation (A) is quoted at T = 5 keV. The fits with B fixed have the slope set to the value found for the appropriate 
0.0 < z < 0.25 sample and fitting method combination listed in Table[2] In all cases, the value of C implies the evolution of the relation is below the self-similar 
expectation. 
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Orthogonal 
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°logZ. x 


Bfree: 






















(l+zf 


44.41 ±0.05 


2.64 ±0.09 - 


0.5 ±0.3 


0.35 ±0.02 


44.67 ± 0.09 


3.04±0.16 


-1.5±0.5 


0.27 ±0.03 




(l+zf 


44.38 ±0.05 


2.63 ±0.09 


0.3 ±0.3 


0.34 ±0.02 


44.65 ±0.09 


3.03±0.16 


-0.7 ±0.5 


0.27 ±0.03 


Lx 


<* E(z) c 


44.41 ±0.04 


2.65 ±0.09 


0.3 ±0.3 


0.34 ±0.02 


44.63 ± 0.07 


3.02 ±0.15 


-0.9 ±0.5 


0.27 ±0.03 


B fixed: 






















(l+zf 


44.48 ±0.04 


2.81 


0.7±0.3 


0.36 ±0.02 


44.73 ±0.07 


3.18 


-1.7±0.4 


0.28 ±0.03 


Lx « 


(l+zf 


44.46 ±0.04 


2.81 


0.1±0.3 


0.36 ±0.02 


44.70 ±0.07 


3.18 


-0.9±0.4 


0.28 ±0.03 


Lx 


~E(zf 


44.47 ±0.03 


2.81 


0.0±0.3 


0.36 ±0.02 


44.68 ± 0.06 


3.18 


-1.2±0.5 


0.28 ±0.03 



see from the results of Section l4~Tl that the bisector method favours 
shallower slopes at high redshift, we repeated the fit with the value 
of the slope fixed to B = 2.81, i.e., as found for the 0.0 < z < 0.25 
subsample (see Table[2]l. In this case, we find C = —0.7 ±0.3. We 
conclude that, regardless of the fitting method, the XCS-DR1 data 
are consistent with negative evolution of the normalisation of the 
Lx — T relation with respect to the self-similar expectation. 

Table[3]presents the fit parameters derived from the full sam- 
ple using both the orthogonal and bisector methods. We show re- 
sults for fits with B as a free parameter, and with B fixed to the 
slope found using the 0.0 < z < 0.25 subsample. For ease of com- 
parison with other works, we also list results using other common 
parametrisations for the evolution of the Lx — T relation in the lit- 
erature. 

As noted in Section [T| while there have been several previ- 
ous estimates for the evolution of the normalisation of the Lx — T 
relation, there is no consensus. Our result is in good agreement 
with the negative evolution of the relation found by Reichert et al. 
< |20 1 1| > from a heterogeneous compilation of 14 datasets, includ- 
ing the z > 0.8 XMM Distant Cluster Project (XDCP; |Fassb ender 
|et al.|20iT i sample, as well as the findings of |Ettori et al. ^2004) 
and |Clerc et al.| |2012l. Pac aud et al.H2007) find evolution consis- 
tent with self-similar from a sample of 24 clusters discovered in 
the XMM-LSS survey, after correcting for selection effects, which 
is consistent with our result given the large error bar on their mea- 
surement. Maughan et al. ( 2012 ) recently examined the Lx — T re- 
lation using a heterogeneous sample of 1 14 clusters drawn from the 
Chandra archive, and find evolution consistent with self-similar at 
z > 0.6, after excising emission from cluster cores. Several other 
studies, based on much smaller samples, have found positive evolu- 
tion, significantly different to that which we see here (e.g. |Vikhlinin| 
|et al.|2002|rLumb et al.|2004l|Kotov & Vikhlinin|2005| >, while our 
result is in mild tension with the results of No vicki et al.| {2002) 
and Maughan et al. (2006). However, as noted by many authors, 
the evolution inferred is dependent upon the choice of local Lx — T 
relation used to set the z = slope and normalisation used in these 
works. 

The main difference between the sample used here in compar- 
ison to previous works (with the exception of Reicher t'et al.|201 T) 
is the larger number of high redshift (z > 0.6) clusters, and it is 
clear from Fig.|5]that a long redshift baseline is needed to constrain 
the evolution of the relation. It will be important to take into ac- 
count both selection effects and the cluster mass function in order 
to reach a definitive conclusion. In the near future, it will be inter- 



esting to compare to measurements of the evolution of this relation 
using Sunyaev-Zel'dovich effect selected cluster samples (e.g. |An-| 
|dersson et al.|20iT| >, once the number of such objects with X-ray 
follow-up becomes large enough. 



5 DISCUSSION 

5.1 Influence of selection effects and the cluster mass 
function 

An important limitation of the analysis we have presented in this 
paper is that the selection function of the survey is not taken into ac- 
count. While modelling of the selection function for XCS has been 
performed (see Sahlen et al. 2009 LD1 1), the optical follow-up re- 
quired for confirmation and redshift measurements is not complete 
(Mehr tens et al.| |2012), meaning that it is not currently possible 
to perform a more sophisticated analysis that jointly fits for both 
cosmological and scaling relation parameters, while taking the se- 
lection function into account (e.g. |Mantz et al. 1 12010), The most 
likely selection effect that could impact our results is Malmquist 
bias. For flux-limited samples, this is well known to give shallower 
slopes, and larger normalisations, in scaling relations, if left uncor- 
rected (see e.g. Section 2.5 of the review by Allen, Evrard & Mantz 
|201 1| ), as a consequence of objects below a luminosity threshold 
being excluded from the sample. We note that the decreasing slope 
with redshift seen in the fits to the XCS-DR1 sample using the bi- 
sector method (Section [4. l\ is likely to be a manifestation of this 
bias. 

|Pacaud et al. (2007 ) investigated the effect of accounting for 
the selection function in their measurement of the evolution of the 
Lx — T relation using the XMM -LSS sample. Their sample cov- 
ers a similar redshift range to XCS-DR1 (0.05 < z < 1.05), but is 
extracted from a survey area of only 5 deg 2 , and so contains only 
24 clusters, with 7 at z > 0.6. With the selection function excluded 
from their analysis, |Pacaud et al.| |2007 ) found positive evolution of 
the Lx — T relation with respect to self-similar (C = 1 .5 ± 0.4, for a 
model of the form Lx °= (1 +zf ', i.e. without scaling the Lx values 
by £~'(z)), which is significantly different to our results (see Ta- 
ble pb. This may be due to the different depths of the two surveys, 
since XCS has searched a large number of XMM observations with 
longer exposure times than XMM-hSS (see Fig. 5 of LD1 1). How- 
ever, after accounting for selection effects, Pacaud et al. 12007]) find 
much milder evolution, which is almost exactly self-similar (al- 
though with large uncertainty). This demonstrates that inclusion of 
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the selection function in the analysis acts to drive the inferred evo- 
lution in a negative direction. Therefore it does not seem possible 
for uncorrected Malmquist bias to explain the negative evolution 
with respect to self-similar that we see. 

We have also not attempted to take into account in the analysis 
the effect of the (theoretically expected) underlying cluster distri- 
bution as a function of mass and redshift. To do this, in principle, 
we would have to assume a prior probability for the cluster temper- 
atures and luminosities, which would be a decreasing function of 
such quantities. Given that the uncertainty in our temperature es- 
timates tends to increase faster with redshift than the uncertainty 
in our luminosity estimates, it may be that the end result of taking 
into account such an effect would be a less pronounced negative 
evolution of the normalisation of the Lx — T relation. However, the 
size of this effect also depends on the full XCS selection function, 
including follow-up incompleteness, the effect of which is likely 
to mitigate this bias to some extent. We have therefore decided to 
defer a more detailed analysis which will quantify the size of this 
effect to future work, once we have a better understanding of the 
XCS follow-up incompleteness. 

5.2 Influence of cool cores, AGN, and group fraction 

Another limitation, due to the serendipitous data used in this anal- 
ysis, is that the low number of counts detected for each cluster, 
coupled to the low resolution off-axis and the high redshift of our 
sources (Section[2}, makes it unfeasible to excise the core emission 
from clusters, or divide the sample into cool-core and non-cool core 
populations (e.g. |Pratt et al.|2009||Maughan et al.|2012) . 

This could affect our results in one of two ways. On one hand, 
cool core clusters are generally easier to detect than non-cool core 
clusters, due to their increased central densities. In this case, it 
could be the case that the XCS sample includes a higher fraction 
of cool cores than the true underlying cluster population, particu- 
larly at high redshift. This seems not to be the case, because we see 
negative evolution of the Lx — T relation normalisation, and cool 
core clusters are known to have a higher Lx — T relation normali- 
sation than the non-cool core population ( |Pratt et al.|2009J . 

On the other hand, it may be that cool core clusters at high 
redshift are under-represented in our sample, due to being classi- 
fied as point sources, rather than extended objects, by the detection 
pipeline (described in LD1 1). This could then contribute to the neg- 
ative evolution of the normalisation that we see. However, simula- 
tions in which model cool core clusters are inserted into real XMM 
observations show that this is not a significant issue for objects de- 
tected with more than 500 counts (LD11). Since the sample used 
in this work contains many objects detected with < 500 counts, 
we repeated our analysis using the subsample of 108 clusters with 
> 500 counts. We find results consistent with those found from the 
whole sample (Table[3]l, with C — — 1 .6 ± 0.6 using the orthogonal 
method, and C = — 1.1 ± 0.4 for the bisector method (for a model 
in the form of Equation [TJ. We conclude that it is unlikely that a 
missing fraction of cool core clusters in our sample could explain 
our results - although it is possible that a real evolution in the cool 
core fraction could cause the evolution that we see, if cool cores are 
less common at high redshift. 

Similarly, although we are able to detect and excise point 
source emission from clusters at low redshift, this naturally be- 
comes increasingly difficult at high redshift, where it may be the 
case that contamination by AGNs is common, as the space density 
of X-ray AGNs increases significantly with redshift (e.g. Silver- 
|man et al.|20 05 1. Unaccounted for AGN contamination could have 



the effect of hardening the X-ray spectra, leading to overestimated 
cluster temperatures, which could explain the negative Lx — T rela- 
tion normalisation evolution that we see, if it affects the majority of 
the sample. These issues can only be addressed through higher res- 
olution X-ray imaging of the high redshift XCS cluster sample (e.g. 
Hilton et al. 2010); however, it will be possible to investigate these 
concerns for a small fraction of the sample with overlapping obser- 
vations in the Chandra archive. We note that a stacking analysis in 
the directions of z > 0.9 clusters by Fassbender, Suhad a & Nastasi| 
( |2012} has examined this issue using XMM-Newton data, and finds 
on average one AGN within 1 Mpc cluster-centric distance per clus- 
ter. While this level of contamination is thought unlikely to have a 
significant effect on cluster flux measurements and sample selec- 
tion, the importance of this potential bias on temperature estimates 
for objects in this redshift range has yet to be quantified. 

It may be the case that there is a break in the cluster scaling 
laws below a certain mass or temperature threshold, due perhaps 
to a change in the physics affecting the ICM between the group 
and cluster regimes (e.g. |Helsdon & Ponman|2000)|Sun et al.|2009[ 
|Stottetal.|2012) . Similarly, Maughan et al.| ( |2012| ) see evidence for 
a break in the Lx — T relation for high temperature (T > 3.5 keV), 
relaxed systems, which seem to follow a relation consistent with 
the self-similar slope (B = 2), whereas lower temperature, unre- 
laxed systems form a steeper relation. However, this effect is only 
seen after the excision of core emission, which is not something 
that can be investigated with our data. Given that our sample con- 
tains a number of low temperature (T < 2keV) systems, and that 
the fraction of low temperature systems decreases as redshift in- 
creases (Figs.[2]and[3]l, we repeat our analysis on the subsample of 
149 T > 2keV clusters. Using both fitting methods, we find that 
such a cut in temperature leads to steeper slopes (B = 3.8 ±0.3 
and B = 3.0 ± 0.2 for the orthogonal and bisector methods respec- 
tively), but does not change the inferred negative evolution of the 
Lx — T relation normalisation: we find C = — 1.8 ± 0.6 using the 
orthogonal method, and C = —0.8 ± 0.4 using the bisector method, 
both of which are in excellent agreement with the results obtained 
using the full sample of 21 1 clusters. 



5.3 Comparison with numerical simulations 

Under the assumption that neither selection effects (Section |5.1| l, 
nor a missing cool core population or significant AGN contamina- 
tion (Section |5.2} , can explain the negative evolution of the Lx — T 
relation normalisation that we see, we now consider the implica- 
tions of our results for cosmological simulations of galaxy clusters. 
We do this by comparing to several simulations, which predict sim- 
ilar Lx — T relations at z = 0, but which behave quite differently 
at high redshift, as a result of the choices made in modelling the 
heating and cooling of the intracluster medium. 



5.3.1 Simulations 

Below we briefly describe the features of the models to which we 
compare. 

CLEF ( Kay et al. 2007 1 is a hydrodynamical simulation of a 
200/z~' Mpc comoving box which includes radiative cooling and 
feedback. The latter is implemented using the 'strong feedback' 
model of Kay ( 2004 ). The amount of energy injection in this model 
effectively tracks the star formation rate - a fraction of the particles 
which pass both a density and temperature threshold are assigned 
an entropy of lOOOkeVcm 2 , which is then distributed through the 
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Table 4. Lx — T relation fit parameters, derived from the bisector and orthogonal methods (see Section [3), for simu lated cluster samples with redshift distribu- 
tion matched to the XCS-DR1 spectroscopic sample used in this work. The simulations are: CLEF I Kay et al. 2007); the Millennium Gas simulations, for which 
MG1-GO = gravity only, MG1-FO = feedbac k only, and MG1-PC = precoolin g (see Short et al. 20 10); and a new version of the Millennium Gas simulation with 
an updated feedback prescription (MG2-FO; |Short, Thomas & Young|2012) . The fitted model is of the form log(£ ,_1 (z)Lx) = A + Blog(7"/5) + Clog(l +z). 
The fits with B fixed have the slope set to the value found from an ordinary least squares fit to the simulated sample at z = 0. Note that although the two fitting 
methods give significantly different values for the slope and z = normalisation in some cases, there is good agreement between the methods on the value of 
the redshift evolution parameter, C. 



Bisector Orthogonal 
Simulation ABC °~logL x ABC °~logL x 



B free: 



CLEF 


45.31 ±0.04 


3.49±0.10 


-1 


42±0 


15 





211 ±0.011 


45 


41 ±0.06 


3.99±0.19 


-1 


25±0 


24 


0.265 ±0.019 


MG1-GO 


45.93 ±0.03 


2.67±0.10 





16±0 


10 





144 ±0.008 


46 


12 ±0.06 


3.27±0.18 





09±0 


13 


0.156±0.012 


MG1-FO 


44.80 ±0.02 


3.12±0.05 





66±0 


08 





104 ±0.006 


44 


78 ±0.02 


3.23 ±0.07 





74 ±0 


12 


0.1 13 ±0.007 


MG1-PC 


44.95 ±0.02 


3.92 ±0.06 


-2 


20±0 


06 





088 ± 0.005 


44 


96 ±0.03 


3.94 ±0.08 


-2 


19±0 


08 


0.088 ±0.006 


MG2-FO 


44.44 ±0.02 


2.68 ±0.04 


1 


82±0 


07 





094 ±0.006 


44 


48 ±0.02 


2.79 ±0.07 


1 


79±0 


10 


0.099 ±0.006 


S fixed: 






























CLEF 


45.23 ±0.03 


3.08 


-1 


54±0 


14 





212±0.011 


45 


26 ±0.04 


3.08 


-1 


68±0 


19 


0.208 ±0.0 11 


MG1-GO 


45.72 ±0.02 


2.00 





21±0 


09 





133 ±0.007 


45 


76 ±0.02 


2.00 





04±0 


13 


0.131 ±0.007 


MG1-FO 


44.79 ±0.01 


3.30 





70±0 


07 





114 ±0.006 


44 


77 ±0.02 


3.30 





82±0 


11 


0.1 13 ±0.007 


MG1-PC 


44.76 ±0.01 


3.30 


-2 


10±0 


07 





096 ±0.005 


44 


77 ±0.02 


3.30 


-2 


13±0 


10 


0.097 ±0.006 


MG2-FO 


44.48 ±0.01 


2.79 


1 


74±0 


07 





091 ±0.005 


44 


47 ±0.02 


2.79 


1 


79±0 


09 


0.095 ± 0.006 



ICM through viscous interactions and shocks. This model produces 
cool core clusters at low redshift, which disappear as redshift in- 
creases, leading to a reduction in the scatter about the relation at 
high redshift. 

The Millennium Gas project {Short e t al. 2010} is a suite 
of hydrodynamical simulations which use the same volume 
(500/i~' Mpc 3 ) and initial perturbations as the Millennium Sim- 
ulation (Springel et al. 2005). This set of simulations includes a 
gravity only 'control' model (MG1-GO); a simulation with energy 
injection at high redshift and radiative cooling (we refer to this as 
the 'precooling' model, or MG1-PC); and a simulation which incor- 
porates feedback from AGN and supernovae, implemented using a 
semi-analytic model (MG1-FO). 

The MG1-PC simulation implements preheating of the clus- 
ter gas at high redshift in a similar fashion to previous work (e.g. 
Biale k et al.|200Tj |Borgani et al.|2002| >. In this case, the entropy 
of each gas particle is raised to 200 keV cm 2 at z = 4. While this 
model is not physically plausible (only two per cent of the baryons 
form stars by z = 0, and the model is incapable of forming cool core 
clusters; Short et al. 2010), it does reproduce the Lx — T relation at 
z = (i Hartley et al.|2008) . 

The MG1-FO model includes supernova and AGN feedback 
using the scheme of Short & Thomas] ( |2009j>, where the semi- 
analytic galaxy formation model employed by |fae Lucia & Blaizot| 
l |2007) is used to infer both the star formation rate (a sink of hot 
gas) and the heating rate due to supernovae and AGN. The AGN 
feedback is implemented using the scheme suggested by |Bower| 
|et al. (|2008|> and is ca pped at two per cent of the Eddington rate 
(see |Short et al.|2010| for full details). The model successfully re- 
produces both the local Lx — T relation ( Short et al. 2010 1, and the 
Sunyaev-Zel'dovich Y — M relation ( |Kay et al.|2012| >. 

We also compare to an updated version of the Millennium Gas 
model with AGN feedback (MG2-FO). This run was performed 
using a 250h~ l Mpc box with higher resolution and updated cos- 
mological parameters (consistent with the WMAP 7-year results; 
|Komatsu et al.|20PT[l. The se mi-analytic galaxy formation model is 
also newer I Guo et al.|201 1} , and the feedback is now implemented 
in a stochastic fashion (only a fraction of the intracluster gas parti- 



cles are heated directly, whereas in the previous model the energy 
was shared throughout the cluster). The model improves agreement 
with non cool-core clusters but, like the previous Millennium Gas 
models, fails to produce the cool-core population due to the ab- 
sence of radiative cooling. Full details of this implementation are 
discussed in Short, Thomas & Young ( 2012) . 



5.3.2 Results 

All of the simulations described above provide measurements that 
are comparable to the real XCS-DR1 data. In all cases, we use to- 
tal (i.e. core included) bolometric luminosity measurements within 
/?50o, which is defined with respect to the critical density, as for the 
XCS-DR1 measurements. We note that each of these simulations 
assumes a slightly different cosmology to the one assumed in this 
work. We have neglected to correct the luminosities to account for 
this, as it is a small effect, and does not significantly affect the evo- 
lution of the normalisation of the relation, which differs substan- 
tially between the models, due to the different physical assumptions 
in each. 

All of the simulations provide spectroscopic-like temperature 
estimates (T s \; Mazzotta et al. 20041. We restrict our analysis in 
all cases to clusters with r s i > 2keV, as this is the regime in 
which the bremsstrahlung mechanism dominates, and is where the 
spectroscopic-like temperatures are most robust. Objects identified 
as satellites to more massive haloes are not included in the samples 
we draw from the simulations. 

For consistency with the analysis of the XCS data, we draw 
random samples from each simulation with redshift distributions 
matched to that of XCS-DR1 (Fig. [I}, and we fit each sample using 
both the orthogonal and bisector methods, as before. Table [4] lists 
the results. As is the case for the real data, for some simulations 
(e.g. CLEF), the bisector and orthogonal methods give different 
slopes and normalisations, with the orthogonal slope being steeper. 
However, we see that in all cases, both methods give consistent val- 
ues for the redshift evolution parameter, C. The better agreement in 
C between the two fitting methods, when used on the simulations 
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Figure 6. Evolution of the normalisation of the L-& — T relation as measured from XCS-DR1, compared with numerical simulations (indicated in the legend; 
see Section [53] and Table|4]for details). The solid thin red line is the best-fit obtained from XCS-DR1 using the orthogonal method, while the black diamonds 
show the corresponding median values for the clusters in each redshift bin (horizontal error bars indicate the redshift range of each bin, while vertical error 
bars indicate the la uncertainty in the median, estimated using bootstrap resampling). The dotted thin red line similarly represents the best-fit for XCS-DR1 
obtained using the bisector method, with the slope fixed at the z = value (2.81, see Table[3}, and the open circles indicate the median values for clusters in 
redshift bins for this set of best-fit Lx — T relation parameters. This latter fit is consistent with no evolution (dot-dashed line). The shaded regions mark the 
marginalised 68 per cent confidence regions. While the amount of evolution inferred from the XCS sample depends on the fitting technique used, there is no 
such dependence for the simulated data (see Table[4), and so we only show the results of the fits to the simulations using the orthogonal method. The XCS data 
favour negative evolution with respect to self-similar (£"(z); horizontal dotted line), and are clearly better described by the CLEF or MG1-PC models, rather 
than the models which implement AGN and supernovae feedback using a semi-analytic prescription (MG1-FO and MG2-FO). 



as compared to the real data, is likely to be due to the absence of 
selection effects in the former. 

In some cases, we find steeper slopes than were measured in 
the original works describing the simulations, most notably in the 
MG1-GO case, where B = 2 is expected. This is due to our fitting 
methods (both |Kay et al.|2007| and |Short et al.|2010| use ordinary 
least squares regression). We checked that this does not bias our 
estimates of C, by repeating the fitting with the slope (6) fixed to the 
values found in |Kay et aL] ( |2007^ and |Short et al.| ( |2010| > from the 
complete simulated samples. These results are also listed in Table|4] 
In all cases, when B is fixed, the values of C change by at most 2a 
in comparison to the fits with B as a free parameter. In no case 
do we find qualitatively different behaviour for a given model as a 
result of changing the fitting technique or fixing the slope: e.g., we 
find negative evolution of the Lx — T relation in CLEF for all the 
variations. 

Fig. [6] shows a comparison of the redshift evolution in the 
simulations with the results from the XCS-DR1 sample, where we 
show the results for both the orthogonal fitting method, and the bi- 
sector method with slope fixed to the z = value (see Section |4~Tj 
and Table [3j. This gives an indication of the systematic uncer- 
tainty in the XCS-DR1 measurement due to the choice of the fitting 
method. We see that in either case the XCS-DR1 data are closer 
to the CLEF and MG1-PC simulations, in which the Lx — T nor- 



malisation evolves negatively with respect to self-similar, and are 
more than 5a away from the evolution predicted in the MG1-FO 
and MG2-FO simulations (irrespective of the fitting method used 
on the XCS-DR1 data). 



The key difference between the feedback models in the simu- 
lation is the epoch at which most of the energy injection occurs. In 
the MG1-PC model, all of the energy input occurs at z = 4, which 
is not likely to be physically reasonable, but serves as a useful ex- 
treme test. In the CLEF simulation, the energy injection occurs over 
a broad range of redshifts, but is skewed to early times, as it directly 
tracks the star formation rate (around two thirds of the stars have 
already formed, and energy injected, by z = 1). Finally, in the MG1- 
FO and MG2-FO simulations, the dominant AGN feedback occurs 
later, when the black holes have grown to sufficient mass to act as 
powerful energy sources. Therefore, the lack of agreement with the 
observations suggests that feedback at high redshift is too ineffi- 
cient in the current models. We note also that radiative cooling is 
not currently implemented in these simulations, and therefore the 
cold gas mass growth rate in the semi-analytic model is not fully 
self consistent with the hydrodynamical simulation. 
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6 CONCLUSIONS 

We have investigated the evolution of the Lx — T relation since 
Z ~ 1.5 using a sample of 211 spectroscopically confirmed X-ray 
clusters drawn from the first XCS data release (Mehrtens et al. 
|2012| >. This is the first such measurement over this wide redshift 
range using a single, homogeneous sample. We find: 

(i) Using both an orthogonal and bisector fitting method, the 
slope of the Lx — T relation for the z < 0.25 subsample of XCS- 
DR1 clusters is consistent with that found for the REXCESS sam- 
ple (Pratt et al. 2009). The normalisation is slightly lower, but con- 
sistent within 2a, using the orthogonal method, although we find 
a 5a lower normalisation using the bisector method. This may be 
in part due to differences in the spectral fitting, or could be due to 
differences in the sample selection. 

(ii) From dividing the sample into redshift bins, using the or- 
thogonal fitting method, we see no evidence for evolution in either 
the slope or intrinsic scatter as redshift increases - both are consis- 
tent with previous measurements at z = 0.1. We see a flattening of 
the slope at high redshift when using the bisector fitting method, 
which could be a signature of the effect of Malmquist bias. 

(iii) Regardless of the fitting method, our data shows that the 
normalisation of the relation evolves negatively with respect to 
self-similar. For the orthogonal method, we find the evolution is 
E- l (z)L x = io 44 - 67 ± °9(r/5) 3 - 04±0 - 16 (l+z)- L5±0 - 5 , which is 
within 2a of the zero evolution case. Using the bisector method, 
with the slope fixed to the value found for the z < 0.25 subsample, 
we find E- l (z)L x <* T 2 ' 81 (l + z)~°' 7±a3 . Malmquist bias would 
have the effect of driving the normalisation in the positive direction, 
and so cannot explain this result. It is possible that a deficit of cool 
cores in the XCS-DR1 sample, or significant AGN contamination 
at high redshift, may contribute to the negative evolution that we 
see. The former seems unlikely, given that a higher signal-to-noise 
subsample gives consistent results to those obtained using the full 
sample, while the latter can only be tested using higher resolution 
X-ray data. 

(iv) From comparison with numerical simulations, we find the 
XCS-DR1 data favour feedback models in which the majority of 
the energy injection occurs at high redshift. AGN feedback models 
based on current semi-analytic galaxy formation model prescrip- 
tions, as used in the Millennium Gas project, predict positive evo- 
lution with respect to self-similar, and differ from the XCS-DR1 
measurements at the > 5a level. This suggests that feedback at high 
redshift in these models is too inefficient. 

A more sophisticated analysis to jointly constrain both cosmo- 
logical and scaling relation parameters, taking into account both a 
model of the survey selection function and the cluster mass func- 
tion, will be possible with improved redshift completeness. We are 
also pursuing velocity dispersion measurements of the high redshift 
XCS cluster sample, and will explore the evolution of the scaling 
of X-ray observables with dynamical mass in future work. 
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